NISP: Pruning Networks using Neuron Importance Score Propagation

نویسندگان

  • Ruichi Yu
  • Ang Li
  • Chun-Fu Chen
  • Jui-Hsin Lai
  • Vlad I. Morariu
  • Xintong Han
  • Mingfei Gao
  • Ching-Yung Lin
  • Larry S. Davis
چکیده

To reduce the significant redundancy in deep Convolutional Neural Networks (CNNs), most existing methods prune neurons by only considering statistics of an individual layer or two consecutive layers (e.g., prune one layer to minimize the reconstruction error of the next layer), ignoring the effect of error propagation in deep networks. In contrast, we argue that it is essential to prune neurons in the entire neuron network jointly based on a unified goal: minimizing the reconstruction error of important responses in the “final response layer” (FRL), which is the secondto-last layer before classification, for a pruned network to retrain its predictive power. Specifically, we apply feature ranking techniques to measure the importance of each neuron in the FRL, and formulate network pruning as a binary integer optimization problem and derive a closed-form solution to it for pruning neurons in earlier layers. Based on our theoretical analysis, we propose the Neuron Importance Score Propagation (NISP) algorithm to propagate the importance scores of final responses to every neuron in the network. The CNN is pruned by removing neurons with least importance, and then fine-tuned to retain its predictive power. NISP is evaluated on several datasets with multiple CNN models and demonstrated to achieve significant acceleration and compression with negligible accuracy loss.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Node Pruning Based on Entropy of Weights and Node Activity for Small-Footprint Acoustic Model Based on Deep Neural Networks

This paper describes a node-pruning method for an acoustic model based on deep neural networks (DNNs). Node pruning is a promising method to reduce the memory usage and computational cost of DNNs. A score function is defined to measure the importance of each node, and less important nodes are pruned. The entropy of the activity of each node has been used as a score function to find nodes with o...

متن کامل

Neuron Pruning for Compressing Deep Networks Using Maxout Architectures

This paper presents an efficient and robust approach for reducing the size of deep neural networks by pruning entire neurons. It exploits maxout units for combining neurons into more complex convex functions and it makes use of a local relevance measurement that ranks neurons according to their activation on the training set for pruning them. Additionally, a parameter reduction comparison betwe...

متن کامل

Pruning of RBF Networks in Robot Manipulator Learning Control

Radial Basis Function Neural Networks are well suited for learning the system dynamics of a robot manipulator and implementation of these networks in the control scheme for a manipulator is a good way to deal with the system uncertainties and modeling errors which often occur. The problem with RBF networks however is to find a network with suitable size, not too computational demanding and able...

متن کامل

Variable Structure Neural Networks for Adaptive Robust Control Using Evolutionary Artificial Potential Fields

A novel neural network architecture, is proposed and shown to be useful in approximating the unknown nonlinearities of dynamical systems. In the variable structure neural network, the number of basis functions can be either increased or decreased with time according to specified design strategies so that the network will not overfit or underfit the data set. Based on the Gaussian radial basis f...

متن کامل

Pruning Using Parameter and Neuronal Metrics

In this article, we introduce a measure of optimality for architecture selection algorithms for neural networks: the distance from the original network to the new network in a metric defined by the probability distributions of all possible networks. We derive two pruning algorithms, one based on a metric in parameter space and the other based on a metric in neuron space, which are closely relat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1711.05908  شماره 

صفحات  -

تاریخ انتشار 2017